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Abstract. The analysis of multiparticle quantum states is a central problem in 
quantum information processing. This task poses several challenges for experimenters 
and theoreticians. We give an overview over current problems and possible solutions 
concerning systematic errors of quantum devices, the reconstruction of quantum states, 
and the analysis of correlations and complexity in multiparticle density matrices. 


1. Introduction 

The analysis of quantum states is important for the advances in quantum optics and 
quantum information processing. Many experiments nowadays aim at the generation 
and observation of certain quantum states and quantum effects. For instance, in 
quantum simulation experiments thermal or ground states of certain spin models should 
be observed. Another typical problem is the demonstration of advanced quantum 
control by preparing certain highly entangled states using systems such as trapped ions, 
superconducting qubits, nitrogen-vacancy centers in diamond, or polarized photons. 

All these experiments require a careful analysis in order to verify that the desired 
quantum phenomenon has indeed been observed. This analysis does not only concern 
the final data reported in the experiment but in fact, many more questions have to be 
considered in parallel. Did the experimenter align the measurement devices correctly? 
Have the count rates been evaluated properly in order to obtain the mean values of the 
measured observables? Such questions are relevant and, as we demonstrate below, ideas 
from theoretical physics can help the experimenters answer them. 

Many experiments in quantum optics can be divided in several steps (see also 
Fig. 0 . In the beginning, some experimental procedures are carried out and 
measurements are taken. The results of the measurements are collected as data. These 
data are then processed to obtain a quantum state or density matrix p, which is often 
viewed as the best description of the “actual state” generated in the experiment. This 
quantum state can then be further analysed, for instance, its entanglement properties 
may be determined. 

In this article, we show how ideas from statistics and entanglement theory can be 
used for analysing the transitions between the four building blocks in Fig. First, we 
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Figure 1. The analysis of many experiments in quantum physics can be divided 
into several steps, from the experimental procedures to the verihcation of quantum 
mechanical properties of the generated states. 


consider the transition from the experimental procednres to the data. We show that 
applying statistical tests to the data can be nsed to recognize systematic errors in the 
experimental procednres, snch as a misalignment of the measnrement devices. Then, we 
consider the reconstrnction of a qnantnm state from the experimental data. We explain 
why many freqnently nsed state reconstrnction schemes, snch as the maximnm-likelihood 
reconstrnction, lead to a bias in the resnlting state. This can, for instance, resnlt in a 
fake detection of entanglement, meaning that the reconstrncted state is entangled, while 
the original state, on which the measnrements were carried ont, was not entangled. We 
also show how snch a bias can be avoided. Finally, we discnss the characterization of 
qnantnm states on a pnrely theoretical level. Assnming a mnltiparticle density matrix 
g we show how its entanglement can be characterized and how the complexity of the 
state can be qnantihed nsing tools from information geometry and exponential families. 

2. Systematic errors in quantum experiments 

In this hrst part of the article, we discnss what assnmptions are typically nsed in 
qnantnm experiments. The violation of these assnmptions leads to systematic errors 
and we show how these systematic errors can be identihed using statistical methods and 
hypothesis tests. 

2.1. Assumptions underlying quantum experiments 

Before explaining the assumptions, it is useful to discuss a simple example. Consider 
a two-photon experiment, where a quantum state should be analysed by performing 
state tomography. For that, Alice and Bob have to measure all the nine possible 
combinations of the Pauli matrices (Ji ® aj for i,j G {x,y,z}. In practice, this can 
be done as follows: Alice and Bob measure the three Pauli matrices cr 3 ,,(jy,cr^ by 
measuring the polarization in different directions, getting the possible results -|- and 
—. These results correspond to the projectors on the eigenvectors of the observable. 
By combining the results, they obtain one of four possible outcomes from the set 

—,—h,-}. The measurement is repeated N times on copies of the state, 

where the outcome occurs A^++|p times etc. From that, one can obtain the relative 
frequencies F++|p = N^+\ij/N and estimate the expectation values as (cTj ® crj)exp = 

_ \ij)/N. In addition, the expectation values of the marginals 
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o'i <8) (Jo (here and in the following, we set cxo = 1) can be determined from the same data. 
Given all the experimental results, Alice and Bob may then reconstruct the quantum 
state via the formula 



( 1 ) 


i,je{0,x,y,z} 


This simple quantum state reconstruction scheme is often called linear inversion. It 
assumes that the observed frequencies equal the probabilities, we will discuss its 
advantages and disadvantages below. For the moment, we just use it as an example 
to illustrate the dehnitions and discussion concerning systematic errors in experiments. 

Now we can formulate the assumptions that lead to the statistical model typically 
used in quantum experiments. We consider a scenario where one actively chooses 
between different measurements (e.g., the ai^aj), each having a hnite number of results. 
We use the label s to denote the measurement setting and r to denote the result. It is 
important to note that, if in an experiment using the setting s one registers the result r, 
then this outcome r|s is not just treated as a classical result. In addition, each outcome 
is tied to an operator Mr\s (e.g., the projectors onto the eigenstates corresponding to the 

results {++, H—, —h,-} of ai®aj) that serves as the object to compute probabilities 

within quantum mechanics: If the underlying quantum state is characterized by the 
density operator p, then the probability to observe r|s is given by P(r|s; g) = tr(pMr|s). 
Therefore, this quantum mechanical description is one of the essential ingredients to 
connect the observed samples with the parameters of the system that one likes to infer. 
Knowledge about this description can come from previous calibration measurements or 
from other expertise that one has acquired with the equipment. But one thing should 
be obvious: If one assumes a description M^i*, which deviates from the true description 
in the experiment then things can go terribly wrong and these type of errors are 
the ones that we like to address in the following. 

Clearly, such deviations are presumably present in any model, but they are 
usually assumed to be small. However, considering the increased complexity of present 
experiments, one can ask the question, whether or not these deviations show up 
signihcantly in the data. Well known examples, like different detection efficiencies 
or dark-count rates in photo-detectors or non-perfect gate hdelities for single-qubit 
rotations preceding the readout of a trapped ion, could support this scepticism. 
However, these effects are hardly ever considered in the description of Mj,p. 

Let us complete the list of assumptions. Most often each experiment of setting s 
is repeated N times, which are assumed to be independent and identically distributed 
trials. So one further assumes that one always prepares the same quantum state p, 
measures the same observables Mrp, and that both are completely independent [|j 

f This means that both, measurements and states are described by the corresponding A^-fold tensor 
products. While such a property can be inferred for the states with the help of the de Finetti theorem [^, 
one should be aware that its exchangeability requirements do not apply to experiments where one 
actively measures first all the s = 1 measurements, followed by all s = 2 measurements and so on. 
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Clearly, also in all these steps there can be errors, for instance, due to drifts in the 
measuring devices or dead-times in detectors coming from previous triggering events. 
However, if everything works as planned, then it is not necessary anymore to keep track 
of the individual measurement results, since every information that can be inferred 
about the state parameters is already included in the count rates Nr\s of the individual 
measurement results r|s. Their probability is then given by a multinomial distribution 
Mult[iV, P{r\s] p)] for each setting, which is the distribution characterizing N repetitions 
of independent trials. Here, the single event probabilities P(r|s; p) = tr{gMr\s) are 
calculated according to quantum mechanics and these are the only parameters that 
depend on the quantum state. 

Finally, the whole collection of distributions for all measurement settings is the 
exact parametric model used for most quantum experiments. These distributions are 
given by the set 

■^QM = |p({iV^|s}r,s; p) = JJ^Mult [iV,tr(pM^I^)] , for all g with g > 0,tr(p) = l|, 

5 

( 2 ) 

and the observed probabilities are assumed to be an element of this set. In the following, 
we discuss how the validity of this model can be tested. 

2.2. Testing the assumptions 

How can one test in this framework whether the assumed measurement description is 
correct for the experiment? As a hrst try, we could intersperse the experiment with 
test measurements, in which one prepares previously characterized states. But such an 
option seems very cumbersome, independent of problems like how to characterize the 
test states in the hrst place and to ensure that they are well prepared in between the 
true experiment. In contrast, we want to do it more directly and this becomes possible, 
at least partially, by exploiting that quantum states only allow a restricted set of event 
probabilities. 

Let us hrst discuss the idea for the case where one has access to the true 
event probabilities Po(^l'S) which can be attained from the relative frequencies Fr\s = 
Afr\s/N —)■ Po(r|s) in the limit N — )■ cx). We want to know whether these observed 
probabilities are at all compatible with the assumed measurement description. This boils 
down to the question whether there exists a quantum state po with Po(r|s) = tr(poAfrp) 
for all r, s. Since quantum states must respect the positivity constraint p > 0, not all 
possible probabilities are accessible: For instance, if one measures a qubit along the 
X, y, z directions, its corresponding probabilities will be constrained by the requirement 
that the Bloch vector must lie within the Bloch ball. To make this more general, assume 
that we have a certain set of numbers Wr\s such that the observable Xlrp^ 
has no negative eigenvalues and is, therefore, positive semidehnite. If the probabilities 
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Po(’"|s) can indeed by realized by a qnantnm state, one has 

w- Po = = ^Wr|str(^oA^r|s) = tT[go{'^ Wr\sMr\s)] > 0, (3) 

r,s r,s r^s 

where the ineqnality holds becanse both operators are positive semidehnite. Thns, if 
everything is correct one mnst get a non-negative valne for w ■ Pq > 0. Conseqnently, 
whenever one observes w ■ Pq < 0, one knows that something must be wrong and that 
the description of the measurements Mr\s has some flaws. This type of inequalities is 
similar in spirit to Bell inequalities for local hidden variable models or entanglement 
witnesses for separable states [^|^. Let us point out that the above inequalities are 
necessary and sufficient. So, indeed any Po(r|s) which cannot originate from a quantum 
state, can be detected by appropriately chosen coefficients Wr\s by tc ■ Pq < 0 |^. 
Finally, we add that besides the positivity, some other constraints for the measurement 
description are conceivable. For instance, in the example of state tomography from 
above, the marginal ® ctq) should not depend on whether it has been derived from 
the measurement ax ® Ox or ax®ay. This can be formulated as a linear dependency of 
the form Yhr s'^r\sAIr\s = 0 and the corresponding constraint even becomes an equality 
w • Po = 0. 

Note that, with this test we ask the question whether the data Po(r|s) fit at all 
to the assumed measurement model Mx\s- But it should be clear that this approach 
can never serve as a proof that everything is correct in the experiment. For example, 
one can consider again the Bloch ball, where the measurement model assumes perfectly 
aligned measurements in the x,y,z directions, but in the true experiment one measures 
in slightly tilted directions which distorts the resulting Bloch ball. All states from this 
tilted Bloch ball, which lie outside the standard sphere, will be detected by the above 
method as being incompatible with the assumed model. For all other states, however, 
we do not see the difference because they are still consistent with the model. 

Finally, let us address the point that we only collect count rates in the experiment. 
Since the relative frequencies Pj-p are only approximations to the true probabilities 
P(r|s), it is clear that a similar inequality as Eq. (|^ does not need to hold anymore for 
w ■ F, even if everything is correct. One would expect, however, that larger negative 
values are much less likely. This is indeed the case and is made more quantitative via 
Hoeffding’s inequality [^. 

This inequality states the following: Consider N independent, not necessarily 
identically distributed, bounded random variables X* G [ai,bi]. Then the sample mean 
X = '^iXi/N satishes 

ProbIX - E(V) < -d < exp (4) 

for all f > 0, where E(X) denotes the expectation value of X. In practice, the main 
statement of this inequality is that for N independent repetitions of an experiment, the 
probability of deviations from the mean value by a difference t scales like exp(— 
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It is important to stress that this result uses no extra assumptions, like N being large, 

at all. 

For our case, we can use Hoeffding’s inequality to bound the probability of observing 
data that violate positivity constraints as in Eq. ([^. More precisely, we can derive the 
following statement [^: For all distributions compatible with quantum mechanics, the 
probability to observe frequencies such that w ■ F < —e is bounded by 

/ 2e‘^N\ 

Probp [tn ■ F < -e] = ^ < exp - ^ ^ 

w-F<—e 

with Cl = and Wmaxp, Wminp being the extreme values of {wq 

Again, this can be interpreted as showing that if everything is correct, then the 
probability of Ending a violation of the positivity constraint is exponentially suppressed. 

We can use this statement as follows: Suppose that we should reach a conclusion 
whether the observed data are “compatible” or “incompatible” with our assumed model. 
Of course, if we say “incompatible”, we do not want to reach this conclusion too often, 
if indeed everything is perfect. For definiteness, we may assume that the probability of 
claiming incompatibility if everything is correct should be at maximum a = 1%. We then 
use Eq. to deduce the threshold value that we need to beat, | log(Q;)|/2A^. 

If we now carry out the experiment and register click rates with w ■ F < —Sq, we know 
that there was at most a. a = 1% chance that we would have registered such badly 
looking data, if everything is correct. Since this would be really bad luck we would 
rather say “incompatible”, and assume that some systematic error was present [§} 

In practice, this test can be used to detect systematic errors in various scenarios: 
In ion trap experiments, a typical systematic error comes from the cross talk between 
the ions, i.e. the fact that a laser focused on one ion also influences the neighbouring 
ions. This phenomenon can be detected with the presented method [^. The second 
application are Bell experiments: In these experiments, the choice of the measurements 
on one party should ideally not influence the results of the other party and a violation of 
this condition completely invalidates the result of a Bell test. Again, this non-signalling 
condition can be formulated as linear constraints on the probabilities and this can be 
tested with the presented method. In all these applications, the determination of the 
vector w characterizing the positivity constraint or the linear constraint can be done as 
follows: One splits the observed data into two parts. From the first part one determines 
the w leading to the maximal violation of the respective constraint for the first half. 
Then, one applies this w as a test to the second part of the data. If the violations 
of the constraint are only due to statistical fluctuations, the respective w for the two 
parts of the data are uncorrelated and the test will not find a significant violation of the 
constraint. 

§ Since one typically likes to leave the choice of appropriate levels of a to the reader one can also 
report the p-value of the observed data: It is the smallest a with which we would have still said 
“incompatible” with the test. 
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Let us point out that the mathematical framework just described is called a 
hypothesis test (^, in which one tests the null-hypothesis Nq: “compatible”, against 
the alternative A: “incompatible”. The special property of such a test is that there 
is an asymmetry about the two types of errors that can occur. As already explained, 
our concern is that, when saying “incompatible”, then this statement is more or less 
correct. The other error can occur when we respond “compatible” to incompatible data. 
Naturally, this error characterizing the detection strength of our test, ideally, should also 
be made small. However, it is not possible to reduce both errors equally simultaneously. 
Nevertheless, since we cannot detect all possible systematic deviations from the assumed 
model, anyway, one should not be too euphoric about the statement “compatible” in 
this sense. 

Note that, while the presented test has been build up by hrst deriving specihc 
inequalities for event probabilities and then equipping it with the necessary statistical 
rigour to arrive at an hypothesis test, one can also take the other direction, by using 
techniques which are known to be good for hypothesis tests and apply them to the 
special statistical model of the quantum experiments. We have done this for the so- 
called generalized likelihood-ratio test and details can be found in Ref. |^. Finally, 
other tests for systematic errors can be found in Refs. jS 

3. Performing state tomography 

In the previous section, we have seen that care has to be taken when making the 
measurements on the quantum system. In this section, we show that the interpretation 
of tomographic data, such as the reconstruction of the quantum state, has to be done 
with care, too. Otherwise, one introduces yet another class of systematic errors. 

3.1. Problems with state estimates 

We are used to summarize experimental data by an estimate together with an error 
margin. In quantum state tomography this corresponds to an estimate for the density 
matrix together with an error region. So, the hrst question is how one can obtain 
an estimate q for the experimentally prepared density matrix Pexp from the observed 
frequencies Fr\s. The simplest approach is to use linear inversion, that is, the method 
given in Eq. Q. This has, however, at hrst sight some disadvantages: Due to statistical 
huctuations the observed frequencies are not equal to the true probabilities and this 
leads to the consequence that the reconstructed “density matrix” will typically have 
some negative eigenvalues. This makes the further analysis of the experiment, e.g. the 
evaluation of entanglement measures, not straightforward. In order to circumvent this, 
one often makes a density matrix reconstruction by setting 

p = argmax a). (6) 

( 7>0 

Here, one optimizes a target function F over all density matrices a and the optimal a 
will obviously be a valid density matrix. Examples for this type of state reconstruction 
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Figure 2. Demonstration of the bias for different state estimators. A given state 
gexp having 80% hdelity with the GHZ state was used to sample the distribution 
of the estimator g in state space, and from these samples the fidelities with the GHZ 
state were computed. The maximum-likelihood (ML) and least-square (LS) estimators 
clearly underestimate the fidelity, while the linear inversion (LIN) is unbiased. The 


figure is taken from Ref. 11 . 


are the maximum-likelihood reconstruction or the least-squares reconstruction, both are 
frequently used for experiments in quantum optics. 

An important property of such an estimator is the question whether it is biased or 
unbiased. This means the following: The underlying state Pexp leads via the multinomial 
distribution to a probability distribution over the frequencies The estimator p is a 
function from the observed data (the frequencies F^p) to the state space. In this way, 
the original state Pexp induces a probability distribution over the estimators g, and one 

7 

can ask whether the expectation value of this equals the original state, E[p] = Pexp- If 
this is the case, the estimator is unbiased, otherwise it is biased. It must be stressed, 
however, that biased estimators are not necessarily useless or bad, as it all depends on 
the purpose the estimator is used for. 

For quantum state reconstruction one can prove the following: Any state 
reconstruction scheme that yields a density matrix from experimental data will be 
biased, i.e., on average, the reconstructed state g will not be the state used in the 

but the 


7 ^ Pexp- A proof of this statement was given in Ref. 11 


experiment, 

following example demonstrates that the problem in finding an unbiased estimator comes 
from the fact that the quantum mechanical state space is bounded by the positivity 
constraint. Consider a coin toss where we are interested in the modulus of the difference 
between the probability of obtaining heads or tails, A = \ph—pt\ = 1 —2min{pft, 1—ph}. 
This quantity cannot be negative, so also an estimator should not be negative. Let us 
assume that an estimator A is unbiased, E[A] = Aexp- Then, for any experimental data 
that could come from a fair coin (Aexp = 0) we cannot have A > 0 since this would 
imply E[A] = ^^(l/ 2 )'^( )A(fc) > 0, where k denotes the number of occurrences of 
heads. On the other hand, any possible number of heads and tails is compatible with 
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a fair coin. So, the estimate A for any data must be 0. Then in particular ]E[A] = 0, 
which means that A is a biased estimator whenever the coin is not fair. 

Apart from this theoretical argument, the question arises whether this effect plays a 
signihcant role in practical quantum state reconstruction. Unfortunately, this is the case 
and this effect can causes substantial hdelity underestimation or spurious entanglement 
detection in realistic scenarios 11 . This problem applies to the established schemes for 


reconstructing a density matrix, in particular to the maximum-likelihood method 12 


and the constrained least-squares method [^. So, how large is the bias? For example, 
in a tomography of a four-qubit GHZ state with hdelity 0.8, when reconstructing from 
a total number of 8100 samples, the state from a maximum-likelihood estimate has 
a hdelity of 0.788 ± 0.010 11 , i.e., the hdelity is systematically underestimated (see 
also also Fig. [^. Such an underestimation may be considered to be unfortunate, 
but acceptable. However, it was also demonstrated that maximum-likelihood and 
least-square methods tend to overestimate the entanglement. In fact, for a clearly 
separable state the reconstructed states can be always entangled, thus leading to 
spurious entanglement detection [^. This is not acceptable for many experiments. 

A way to avoid the bias is to accept that the reconstructed density matrix is not 
always a valid quantum state and can have negative eigenvalues. The simplest unbiased 
method is linear inversion explained above. More generally, if Mj.\s are the operators 
corresponding to the measurement outcomes in a complete tomographic measurement 
scheme, then one can hnd operators Xj-p such that for all states g = Xlrp g) 

holds, generalizing Eq. 0i The estimate given by linear reconstruction is then 
g = where Fr\s are the relative frequencies of the result r for setting 

s. This estimate is unbiased but it comes with the price that in all realistic scenarios 
g has some negative eigenvalues and hence it is not a valid density matrix. Depending 
on the intended use of the reconstructed density matrix this may be problematic, but it 
was shown in Ref. 11 that entanglement measures or the Fisher information can still be 
estimated. In addition, we stress that the eigenvectors corresponding to these negative 
eigenvalues are randomly distributed in the following sense: If we choose a rank one 
projection |Q;)(a| independently of the data, then the probability that tr(p|Q;)(Q:|) < —e 
is exponentially suppressed, as can be seen from the inequality in (|^. 


3.2. Problems with error regions 

Any report of an experiment has to equip the reported results with error bars. In the 
case of a density matrix, this will be a high-dimensional error region. When specifying an 
error region, one hrst has to decide between the Bayesian framework and the frequentistic 
framework. A Bayesian analysis gives a credible region, which has the property that 
with high probability the actual state is in this region. A frequentist’s analysis gives 
a conhdence region, which is a map from the data to a region in state space such 
that with high probability the region contains the actual state. There is a long debate 

II The new operators W|s may be necessary, since the can be overcomplete or not orthogonal. 
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in mathematical statistics which method is appropriate, but most of the subsequent 
discussion is independent of this dispute. 

Before discussing the advantages and disadvantages of an error region, it is 
important to remember, that the variance does in general not give an appropriate 
error region. This occurs in particular if the underlying distributions are far from 
being Gaussian. But for state tomography, the data is sampled from a multinomial 
distribution, typically with a very low number of events. Indeed, in many experiments 
the number of clicks per measurement outcome is about ten, but sometimes even below 
one. Also the method of bootstrapping may yield an inappropriate error region. In 
bootstrapping, one uses an estimate g for the state (parametric bootstrapping) or 
the empirical distributions of the outcomes of the measurements (non-parametric 
bootstrapping) in order to estimate the variance of the estimate. This estimate is usually 
obtained by Monte Carlo sampling from the corresponding distributions. There is no 
particular reason that this should be a good error region, and it was also demonstrated 
that the most commonly used schemes yield invalid error regions. 

Methods to obtain valid error regions both in the Bayesian 14 and in the 
frequentistic framework 15 have been suggested, however, they turn out to be 


notoriously difficult to compute. But even when it is possible to achieve a proper error 
region, one has to keep in mind that the size of the error region scales with the dimension 
of the underlying Hilbert space, i.e., exponentially with the number of qubits. This 
makes it very difficult to perform state tomography of a large system with a reasonable 
sized error region. Fortunately, in many situations the error region for the state is not 
of uttermost importance. Often one is only interested in certain scalar quantities like a 
measure of entanglement or the fidelity with the (pure) target state. In this cases it is 
possible to infer an appropriate confidence region directly from the data, without taking 
the detour over an error region for the density operator. This is particularly simple, if 
the quantity of interest is linear in the density matrix, e.g., the fidelity with a pure state 
F = ('0|p|'^). One can again use Hoeffding’s tail inequality in order to obtain a lower 
bound Fi on the fidelity. The promise is then that P{Fi > (t/’|Pexp|t/’)) < 1% for any 
state Pexp- A general method to provide such confidence regions for convex functions, 
like the bipartite negativity or the quantum Fisher information, has been introduced in 
Ref. 


11 


4. Analysing density matrices 

In the last section of this article, we assume that a valid multiparticle density matrix 
g is given and the task is to analyse its properties. Naturally, many questions can be 
asked about a density matrix, but we concentrate on two of them. First, we consider 
the question whether the state is genuinely multiparticle entangled or not. We explain 
a powerful approach for characterizing multiparticle entanglement with the help of so- 
called PPT mixtures and semidefinite programming. Second, we consider the problem 
of characterizing the complexity of a given quantum state and explain an approach using 
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exponential families. For example, in this approach a state that is a thermal state of 
a Hamiltonian with two-body interactions only, is considered to be of low complexity 
and the distance to these thermal states can be considered as a measnre of complexity. 
The underlying techniques also allow to characterize pure states which are not ground 
states of a two-body Hamiltonian. 

4-1. Characterizing entanglement with PPT mixtures 

4-1.1. Notions of entanglement — Before explaining the characterization of 
multiparticle entanglement, we have to explain some basic facts about entanglement 
on a two-particle system. The dehnition of entanglement is based on the notion of local 
operations and classical communication (LOCC). If a quantum state can be prepared by 
LOCC, it is called separable, otherwise it is entangled. For pure states, this just means 
that product states of the form |0) = |a) ® \(4) are separable and all other states (e.g. 
the singlet state \'4)~) = (|01) — |10))/-\/2) are entangled. If mixed states are considered, 
a density matrix q is separable, if it can be written as a convex combination of product 
states, 

Q = ^PkQ)i®dh, (7) 

k 

where the pk form a probability distribution, so they are non-negative and sum up to one. 
Physically, the convex combination means that Alice and Bob can prepare the global 
state by hxing the joint probabilities with classical communication and then preparing 
the states and pf separately. The question whether or not a given quantum state is 
entangled is, however, in general difficult to answer. This is the so-called separability 
problem (^[^. 

Many separability criteria have been proposed, but none of them delivers a complete 
solution of the problem. The most famous separability test is the criterion of the 
positivity of the partial transpose (PPT criterion) [^. For that, one considers the 
partial transposition of a density matrix g = Yhijki Qij,kiV){j\ ® l^)(^l) given by 

® |/c)(^|- (8) 

ij,kl 

In an analogous manner, one can also dehne the partial transposition with respect 
to the second system. The PPT criterion states that for any separable state g the partial 
transpose g'^^, (and consequently also has no negative eigenvalues and 

is therefore positive semidehnite. So, if one hnds a negative eigenvalue of then 
the state g must necessarily be entangled. The PPT criterion solves the separability 
problem for low dimensional systems (that is, two qubits or one qubit and one qutrit) [^, 
but in all other cases the set of separable states is a strict subset of the PPT states. 
The entangled states which are PPT are of great theoretical interest: It has been shown 
that their entanglement can never be distilled to pure state entanglement, even if many 
copies of the state are available. This weak form of entanglement is then also called bound 
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entanglement and bound entangled states are central for many challenging questions in 
quantum information theory. 

The characterization of entanglement becomes significantly more complicated, if 
more than two particles are involved. Let us consider three particles (A, B, C). First, 
a state can be fully separable, meaning that it does not contain any entanglement and 
is of the form = |a) 0 \/3) ® I7). If a state is entangled, one can further ask 
whether only two parties are entangled or all three parties. For instance, in the state 
| 0 bs) _ \2p~)AB ® \l)c fhe parties A and B are entangled, but C is not entangled 
with A or B, therefore the state is called biseparable. Alternatively, if all parties are 
entangled with each other, the state is called genuine multipartite entangled . For the 
simplest case of three two-level systems (qubits) it has been shown that even the genuine 
multipartite entangled states can be divided into two subclasses, represented by the GHZ 
state \GHZ) = (|000) -|- |lll))/-\/2 and the W state \W) = (|001) -|- |010) -|- |100))/-\/3. 
These subclasses are distinguished by the fact that a single copy of a state in one class 
cannot be converted via LOCC into a state in the other class, even if this transformation 
is not required to be performed with probability one [^. 

The classification of entanglement for pure states can be extended to mixed states 
by considering convex combinations as in Eq. ([^. First, a mixed state is fully separable, 
if it can be written as a convex combination of fully separable states 

/" = ® 2 I G) (9) 

k 

and a state is biseparable, if it can be written as a mixture of biseparable states, which 
might be separable with respect to different partitions, 

pbisep _ ( 10 ) 

The different notions of entanglement in the multipartite case and the different 
bipartitions that have to be taken into account imply that the question whether a 
given mixed multipartite state is entangled or not is extraordinarily complicated. 


J 1 ..I. 2 . The approach of PPT mixtures 


genuine multiparticle entanglement makes use of so-called PPT mixtures 17 


A systematic approach for characterizing 

Instead 


of asking whether a state is a mixture of separable states with respect to different 
partitions as in Eq. (10), one asks whether it is a mixture of states which are PPT for 
the bipartitions 

( 11 ) 


^PPtmix _ + P2Q^^\aC + 


Since the separable states are a subset of the PPT states, any biseparable state is also 
a PPT mixture. This means that if a state is no PPT mixture, then it must be genuine 
multipartite entangled (see also Fig. [^. 

At first, it is not clear what can be gained by this redefinition of the problem. First, 
the condition for PPT mixtures is a relaxation of the definition of biseparability and it 
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Figure 3. Schematic view of the states which are PPT mixtures and the biseparable 
states for three particles. There are three possible bipartitions, and the corresponding 
sets of states which are separable or PPT for the bipartition. The figure is taken from 

Ref. [^. 


might be that the conditions are relaxed too much, implying that not many states can 
be detected by this method. Second, it is not clear how the criterion for PPT mixtures 
can be evaluated in practice and whether this is easier than evaluating the conditions for 
separability directly. In the following, however, we will see that the question whether a 
state is a PPT mixture or not can directly be checked with a technique called semidehnite 
programming. Furthermore, the approximation to the biseparable states is rather tight, 
and for many families of states the property of being a PPT mixture coincides with the 
property of being biseparable. 

4-1.3. Evaluation of the criterion — Let us discuss the evaluation of the condition for 
PPT mixtures. For that, we need to introduce the notion of entanglement witnesses. In 
the two-particle case, an entanglement witness W is an observable with the property that 
the expectation value is positive for all separable states, tr(p®®P>V) > 0. This implies 
that a measured negative expectation value signals the presence of entanglement. In this 
way, the concept of an entanglement witness bears some similarity to a Bell inequality, 
where correlations are bounded for classical states admitting a local hidden variable 
model, while entangled states may violate the bound. 

How can entanglement witnesses be constructed? For the two-particle case a simple 
method goes as follows: Consider an observable of the form 

W = P + Q^^, ( 12 ) 

where P > 0 and Q > 0 are positive semidehnite operators. Using the fact that 
Tr{XY'^^) = Tr{X'^^Y) for arbitrary operators X, Y, we hnd that for a separable state 
rr(Wp®®P) = Tr(Pp®®P) -b > 0, since has to be PPT. Therefore, the 

observable W is an entanglement witness, which may be used to detect the entanglement 
in states that violate the PPT criterion. 
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This construction can be used to decide whether a given three-particle state is a 
PPT mixture or not. For that, consider the optimization problem 

minimize TrlpW) 

subject to: W = Pi + = P 2 + = P 3 + and 

Fj > 0 for i = 1, 2, 3 and 

1 > Qi > 0 for i = 1,2,3. (13) 

The constraints guarantee that the observable W is of the form as in Eq. ( [I^ for any of 
the three bipartitions. This means, that if a state is a PPT mixture as in Eq. 0 , the 
expectation value Tr(p>V) has to be non-negative. On the other hand, one can show 
that if a state is not a PPT mixture, then the minimization problem will always result in 


a strictly negative value 17 . In this way, the question whether a state is a PPT mixture 


or not, can be transformed into a optimization problem under certain constraints. 

The point is that the optimization problem belongs to the class of semidefinite 
programs (SDP). An SDP is an optimization problem of the type 


minimize 

Xi 




CiXi 


subject to: Fq -|- ''^^XiFi > 0, 


(14) 


where the Cj are real coefficients defining the target function, the Fj are hermitean 
matrices defining the constraints and the Xi are real coefficients which are varied. This 
type of optimization problem has two important features 18 . First, using the so-called 


dual problem one can derive a lower bound on the solution of the minimization, which 
equals the exact value under weak conditions. This means that the optimality of a 
solution found numerically can be demonstrated. In this way, one can prove rigorously 
by computer whether a given state is a PPT mixture or not. Second, for implementing 
an SDP in practice there are ready-to-use computer algebra packages available and 
therefore the practical solution of the SDP is straightforward. 


4- 1 . 4 . Results — Concerning the characterization of PPT mixtures, the following 
results have been obtained: 


First, the practical evaluation of the SDP in Eq. (13) can be carried out easily 


with standard numerical routines. A free ready-to-use package called PPTMixer is 


available online 19 , and it solves the problem for up to six qubits on standard 
computers. For a larger number of particles, the numerical evaluation becomes 


difficult, but analytical approaches are also feasible 17,22 


For many families of states, the approach of the PPT mixtures delivers the strongest 
criterion of entanglement known so far. For many cases it even solves the problem of 
characterizing multiparticle entanglement. For instance, three-qubit permutation- 
invariant states are biseparable, if and only if they are PPT mixtures 20 . The 
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same holds for states with certain symmetries, like GHZ diagonal states or fonr- 


qubit states diagonal in the graph-state basis 21,22 


Nevertheless, the approach of PPT mixtnres can not detect all mnltiparticle 
entangled states. There are examples of genninely entangled three-qnbit and three- 
qntrit states, which are PPT mixtures 23,24 . For an increasing dimension and 


number of particles one can even show that the probability that a given multiparticle 
entangled state can be detected by the PPT mixture approach decreases 


25 


This finding, however, is in line with the observation that also in bipartite high¬ 
dimensional systems no single entanglement criterion detects a large fraction of 
states 1^ . 

The value A/'(p) = —Tr(p>V), that is, the amount of violation of the witness 
condition is a computable entanglement monotone for genuine multiparticle 
entanglement 22 . It can be called the genuine multiparticle negativity, as it 


generalizes the entanglement measure of bipartite negativity. 

An interesting feature of the PPT mixer approach is that it can also be evaluated, if 
only partial information on the state g is available. Namely, if only the expectation 
values (Ai) of some observables Ai are known, one can add in the SDP in Eq. (13) 
that the witness W should be a linear combination of the measured observables 
W = Yli ^i^i- If can be shown that this is then still a complete solution of the 
problem, meaning that the SDP returns a negative value, if and only if all states 
that are compatible with the data (Aj) are not PPT mixtures. 


4-2. Characterizing the complexity of quantum states 

Besides the question whether a given multiparticle quantum state is entangled or not, 
one may also be interested in other questions about a reconstructed quantum state g. 
For instance, one may ask: Is the given state is a ground state or thermal state of a 
simple Hamiltonian? In the following, we will explain how this question can be used to 
characterize the complexity of a many-body quantum state. 


4.2.1. Exponential families — First, one can consider the set of all possible two-body 
Hamiltonians. For multi-qubit systems they are of the form 

^2 = 5^ (15) 

i,a 

where a^a is the Pauli matrix a a acting on the i-th qubit. This Hamiltonian H 2 contains, 
apart from the identity, single-particle terms and two-particle interactions. However, no 
geometrical arrangement of the particles is assumed and the two-particle interactions 
are between arbitrary particles and not restricted to nearest-neighbour interactions. We 
also denote the set of all two-particle Hamiltonians by 7^2, and in a similar manner one 
can define the sets of fc-particle Hamiltonians "Hfc. 











Analysing multiparticle quantum states 


16 


Given the set of fc-particle Hamiltonians, we can define the so-called exponential 
family of all thermal states 


Qk = {exp{Hk} with Hk G Tik}, (16) 

where the normalization of the state has been included into the Hamiltonian via the 
term ul. 

If a given quantum state is in the family Qk for small k, then one can consider it to 
be less complex, since only a simple Hamiltonian with few parameters are required to 
describe the interaction structure. One the other hand, if a state is not in the exponential 
family, one can consider the distance 


DkigWQk) ■= inf D{g\\T]) 

rj^Qk 


(17) 


with D{g\\ri) = tr[plog(p)] — tr[plog(77)] being the relative entropy, as a measure of the 
complexity of the quantum state. The optimal r] is also called the information projection 
gk and one can show that this gk is the maximum likelihood approximation of g within 
the family Qk (^. Below, we will explain several further equivalent characterizations 
which can help to solve the underlying minimization problem. 

This type of complexity measure has been hrst discussed for the case of classical 
probability distributions in the context of information geometry 28 . The measure Di 


is also known as the multi-information in complexity theory 29 . For classical complex 


systems, these quantities have been used to study the onset of synchronization and chaos 
in coupled maps or cellular automata 27 . For the quantum case, this measure and its 


properties have been discussed in several recent works 30 -33 


At this point, it is important to note that in the quantum case as well as in the 
classical case the quantity Dk does not necessarily decrease under local operations 
31,32. Simple examples for this fact follow from observation that taking a thermal 


state of a two-body Hamiltonian and tracing out one particle typically leads to a state 
that is not a thermal state of a two-body Hamiltonian anymore. Therefore, the quantity 
Dk should not be considered as a measure of correlations in the quantum state, it is 
more appropriate to consider it as a measure of the complexity of the state. 


4-2.2. Characterizing the approximation — For the characterization of the information 
projection the following result is quite helpful [^. First, let g be an arbitrary 
quantum state, and gk be the information projection onto the exponential family Qk- 
Furthermore, let Mk be the set of all quantum states that have the same fc-body 
marginals as g. Mk is, contrary to Qk, a linear subspace of the space of all density 
matrices (see Fig. |^. Then, the following statements are equivalent: 

(a) The state gk is the closest state to g in Qk with respect to the relative entropy. 

(b) The state gk has the maximal entropy among all states in Mk- 

(c) The state gk is the intersection Qk ft Mk- 
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Figure 4. The information projection gk of the state g is the closest state to g within 
the exponential family Qk- Mk denotes the set of all quantum states that have the 
same fc-body marginals as g, and can also be used to characterize gk- For arbitrary 
states g' within Qk the relation D{g\\g') = D{g\\gk) + D{gk\\g') holds, which resembles 
the Pythagorean Theorem. The figure is taken from Ref. 


This equivalence can be used for many purposes. 


developing an algorithm for computing the information projection 33,34 


For example, it is useful for 
Instead of 


minimizing the relative entropy as a highly nonlinear function over Qk, one can do 
the following: One optimizes over all states in Qk with the aim to make the /c-body 
marginals the same as for the state p. The resulting algorithm converges well and 


allows the computation of the complexity measure Dk for up to six qubits 33 


Second, from the equivalences it follows that the multi-information Di can directly 
be calculated, since the closest state to g in the family Qi is the product state 
Pi = Pi <8) P 2 <8 ... ® pv built out of the reduced single-particle density matrices of 
g. Clearly, pi has the same marginals as p and maximizes the entropy. 


4-2.3. A five-qubit example — As a hnal example, let us discuss how the notion of 
exponential families can help to characterize ground states of two-body Hamiltonians. 
For that, consider the hve-qubit ring-cluster state |i? 5 ). This state is dehned to be the 
unique eigenstate fulhlling 

I.R5) = ( 18 ) 

where pi = a,^a^lla^,g 2 = cr^cr^cr^ll, pg = la^a^a^l,g 4 = and ps = 

azlla^ax- Here, the tensor product symbols have been omitted. After appropriate 
local transformations, the ring-cluster state can also be written as 

\R^) = 1- [lOOOOO) + lOOllO) - loioil) + lOllOl) + 110001) - llOlll) + 111010) + 111100)]. 

v8 

(19) 

The ring-cluster state is an example of a so-called graph state, and plays an important 
role in quantum error correction as a codeword of the hve-qubit Shor code. It was known 
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before that the state |i? 5 ) cannot be the nniqne gronnd state of a two-body Hamiltonian 
35 . This, however, leaves the question open whether it can be approximated by ground 


states of two-body Hamiltonians. For instance, for three qubits it was shown that not 
all pure states are ground states of two-body Hamiltonians, but all pure states can be 


approximated arbitrarily well by such ground states 36 


The characterization of the exponential families from the previous section can 
indeed help to prove that the state has {R^) has hnite distance to all thermal states 
of two-body Hamiltonians. For that, hrst note that the two-body marginals of the state 
|/? 5 )(i? 5 | are all maximally mixed two-qubit states. Then, one can directly hnd states 
which have the same two-body marginals, but their entropy is larger than the entropy 
the state |i? 5 )(i? 5 |. This last property is, of course, not surprising, since |i? 5 )(i? 5 | has 
as a pure state the minimal possible entropy. According to the previous section, this 
already implies that |i? 5 )(i? 5 | cannot be the thermal or ground state of any two-body 
Hamiltonian. 

Furthermore, if an arbitrary state g has a high hdelity with then the two-body 
marginals will be close to the maximally mixed states, and in addition the entropy of g 
will be small. This implies that one can hnd again states with the same marginals and 
higher entropy. Using these ideas and some detailed calculations one can prove that if 
a state fulhls 

F={R,\g\R,)>^^0M875. (20) 


then it cannot be a thermal state of a two-body Hamiltonian j^. This shows that 
the state |i? 5 ) cannot be approximated by thermal states of two-body Hamiltonians. In 
principle, this bound can also be used to prove experimentally that a given state is not 
a thermal state of a two-body Hamiltonian. 


5. Conclusion 

In conclusion we have explained several problems occurring in the analysis of 
multiparticle quantum states, ranging from systematic errors of the measurement devices 
to the characterization of ground states of two-body Hamiltonians. We believe that 
several of the explained topics are important to be addressed in the future. First, since 
the current experiments in quantum optics are getting more and more complex, advanced 
statistical methods need to be applied in order to reach solid conclusions. Second, the 
analysis of ground states and thermal states of simple Hamiltonians is relevant for 
quantum simulation and quantum control, so direct characterizations would be very 
helpful. 
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